A nonenumerative algorithm to find the 
k longest (shortest) paths in a DAG 

Fatih Ko^an 

Abstract 

In this paper, we present a novel and efficient algorithm to find the k longest (shortest) 
paths between sources and sinks in a directed acyclic graph (DAG). The algorithm does 
not enumerate paths therefore it is especially useful for very large k values. It is based 
on the Valued-Sum-of-Product (VSOP) tool, which is an extension of Zero-suppressed 
Binary Decision Diagrams (ZBDDs). We assessed the performance of this algorithm 
with a DAG model of a path-intensive combinational circuit, viz. c6288, that has 
~ 10 20 paths. We found that it took about 64 minutes to compute all paths in this 
DAG along with their lengths. 

1 Introduction 

The classical problem of finding the shortest or longest paths in a directed graph 
has been generalized to find the k shortest or k longest paths. In the k shortest 
path problem, the paths are ranked in the increasing order of their lengths while 
in the k longest path problem, the paths are ranked in the decreasing order 
of their lengths. The k shortest (longest) path problem is to find k shortest 
(longest) paths between two given vertices s and t in a directed graph with non- 
negative edge weights for a positive k value. The shortest (longest) paths can 
be simple or with loops. There are two well-known k shortest nonsimple path 
finding algorithm. One of them achieves 0(m + kn logn) [4] and the second one 
achieves 0(m + n log n + k) [3] run times. 

The problem of finding the k shortest simple paths has been studied in 
P3 El HI [ED HH1 H 12] • The best asymptotic performance has been obtained in 
[171118] and it achieves 0(kn(m + n log n)) worst-case run time. The work in [5 J 
outperforms the algorithms with the best known results, esp. for large graphs. 
It can be noticed that when k is very large, such as k = 10 15 , the algorithms 
become impractical. 

There are also studies to compute the k longest paths in DAGs Q]2 [Hfl [5].. 
All algorithms enumerative paths in the order of their lengths. In this paper, 
we focus on finding k longest paths in directed acyclic graph (DAG) with any 
integer weights. Later, we modify the algorithm to compute the k shortest 
paths, the k~th longest path, and the k — th shortest path. Our algorithm does 
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not enumerate paths and relies on the implicit calculation of paths and their 
lengths. Also, we aim at very large k values. 

Our algorithm is inspired by the recent usage of the ZBDD tool in path delay 
fault coverage (PDF) calculation [THl HH [H] and the availability of the Valued- 
Sum-of-Products (VSOP) tool [15], an extension to the ZBDD tool [7J. The 
algorithms are all nonenumerative algorithms. The algorithms may encounter 
memory overflow during their computations. In this case, the algorithms would 
be run on partitioned graphs to avoid memory overflow that causes the memory 
swapping, which in turn increase the run times 

The rest of the paper is organized as follows. Section II overviews the VSOP 
tool. Section III introduces the k longest (shortest) path finding algorithm. 
Section IV gives the experimental result for a path-intensive DAG. Finally, we 
conclude the paper. 

2 Valued-Sum-of-Product (VSOP) Tool based on ZBDDs 

A combinatorial item set is a set of elements each of which is a combination out 
of n items. Zero-suppressed BDDs (ZBDDs) are special type of BDDs that are 
designed for implicit representation and efficient manipulation of combinatorial 
item sets [7]. The manipulation time is proportional to the size of the underlying 
ZBDD. Therefore, ZBDDs utilize certain reduction rules to compactly represent 
a set. The size of a ZBDD is also sensitive to the order of items (i.e., variables) in 
a ZBDD. Many static and dynamic variable ordering approaches are investigated 
for that purpose. We explain a ZBDD with the help of an example. 

Let F — {abc, ab, be} be a combinatorial item set. The set of items in F are 
{a, b, c}. Figure [TJ illustrates the ZBDD for F. The items are ordered as a being 
the first, b being the second, and c being the third. There are two types of edges: 
0— edges (dashed line) and 1— edge (solid line). Each path from the root to leaf 
1 corresponds to a combination in the set. In a path, an solid (dashed) outgoing 
edge from a node indicates that the respective item is included (excluded) in 
(from) the combination. 

Valued-Sum-of-Product (VSOP) [TSJIH] is an extension to ZBDD, and sup- 
ports assignment of values to the terms in a sum-of-products expression and 
enables efficient manipulation of expressions. For example, E = 2ab + ac — 3bd 
is a VSOP expression where terms ab, ac, and bd have 2, 1, and —3 values, 
respectively. In VSOP algebra, addition and subtraction follow the ordinary 
rule (a + a = 2a) but the multiplication does not, and a x a = a, not a 2 . VSOP 
adopts the base (—2) binary encoding of numbers and an integer number is 
decomposed into an n— digit vector of ZBDD {Fq,Fx, . . . , F n _i}. Fi is a ZBDD 
and stores the terms which have a 1 in their (i + l) th digits in their encodings. 
Specifically, Fq is the set of odd- valued terms, F± is the set of terms which have 
a 1 in the second position etc. 

The VSOP expression F = 4a6c+ hab + 36c + a is represented in VSOP as in 
Table[T] The encoding yields Fq = {ab, be, a}, Fx = {be}, and F 2 — {abc, ab, be}. 

The following VSOP operations are used in the following sections and their 
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Fig. 1: ZBDD for F = {abc, ab, be} 



Tab. 1: Encoding values of F's terms 



F 


Value 


F 2 




Fo 


abc 


4(100) 


1 








ab 


5(101) 


1 





1 


be 


3(111) 


1 


1 


1 


a 


1(001) 
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details can be found in |14] . 

• Ei. Permit {E2): Extract terms in E\ each of which is included in one of 
combinations in E 2 . 

• Ei. Restrict {E 2 )'. Extract terms in Ei each of which includes one of com- 
binations in E 2 . 

• Ei.TermsOP (E 2 ): Filter terms in E x "OP" to the constant term of E 2 
where OP = {EQ, NE, LE, LT, GT, GE}. 

• E\.OP _Const(K): Filter terms in E\ whose terms are "OP" to constant 
K where OP = {EQ, NE, LE, LT, GT, GE}. 

• Ei op E 2 : Two expressions are subject to arithmetic or logical operation 
op, where OP = {+, *, /, %, ==, ! =, <, <=, >, >=}. 

• E.CountTerms (): The number of minterms in expression E. 

• E.TotalVal () , E.MaxVal () , E.MinVal (): Sum of the minterm values, 
maximum value in the set, and minimum value in the set. 

• I .GetlntQ: Convert VSOP value / to integer. 

• E.MinCoverQ: Return a minimum- valued term in E. 
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Tab. 2: VSOP examples 



Operation 


XlCSUiX 


F.Restrict(a) 


4abc + 5ab + a 


F.Restrict(ab) 


4a6c + 5a6 


t Mestrictya + b) 


^ T i f T i O 7 i 

4aoc + oao + ooc + a 


F.Permit(ab) 


5a6 + a 


F.Permit(abc) 


Aabc + bab + 36c + a 


F.Permit{c) 





r.Countl ermsy) 


4 




5 


F.Minv al{) 


1 


F.TermsGE(3) 


4afec + bab + 36c 


F.TermsLT(3) 


a 


F + G 


4a6c + lOab + a 


F-G 


Aabc + 6bc + a 


FxG 


babe + 30ab — 96c 


F==G 


ab 


F > G 


abc + be + a 


G > F 





F\=G 


a6c + 6c + a 


F.MinCover() 


a 


F.MaxCoverQ 


bab 



• E.MaxCoverQ: Return a maximum-valued term in E. 

Let F — Aabc + bab + 36c + a and G = bab — 36c. Some operations on F and 
G and their outcomes are tabulated in Table [2j Comparison operations are 
better understood if we assume coefficients for the nonexisting terms in G i.e.. 
G = Qabc + bab - 36c + Qa. 

3 Algorithm 

This section introduces the algorithm that store all or selected paths along with 
their lengths. Then, we select the k longest paths from this path database. 
Later, we explain the k shortest path finding algorithm. 

To find the k longest paths, we utilize a data-driven binary search (DDBS) 
algorithm (Algo. [TJ. Since sorting and index-based accessing to the paths 
in a VSOP is not possible, the DDBS algorithm uses nonenumerative VSOP 
operators to find the set of k longest paths from the built path database. Also, 
in this algorithm, when [ ma ^|""' j — rain, the search to find the set is repeated 
for min + 1 in the last step. Otherwise, the algorithm would not terminate. 

We present an algorithm (Algo. \2§ to find the k longest paths with VSOP 
tool nonenumeratively. Algorithm [2] starts from the source vertices and induc- 
tively builds the path database in the topological order of vertices. At each 
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Algorithm 1 TopK(K, £) 
Require: < K < |£| 

min 4— C.MiriValQ 
max <— C.MaxValQ 
mid_prev 
while !donc do 

mid_prev <— mid 
mid <— (rain + max)/2 

if Ctol _EQ(mid,mid_prev).GetInt() then 

mid <— mid + 1 

done 1 

break 
end if 

cl <— (C.EQ _Const(mid)).CountTerms() 
c2 (C.GT _Const(mid)).CountTerms() 
c3 <- cl + c2 

flag «- ci.GT _Const(K)) && c2.LT _Const(K) 

if (c3.£Q_Cor«!i(J0)||/Zaff) then 
done 1 
6rea/c 

else if (c3.LT _Const(K)).GetInt() then 
max mid 
continue 

else if (ci.GT _Const(K)).GetInt() then 

mm 4— mid 

continue 
end if 
end while 

return C.FilterThen(C.GE _Const(mid)) 



vertex, the algorithm builds the setO of partial paths that end at this node. The 
sum of sink nodes' paths is the set of all paths along with their lengths. In 
the algorithm, we optionally utilize early pruning of infeasible paths by calling 
TopK() algorithm with parameter k. 

In Algo. [2j £ and ds are VSOP expressions, and d == Ci removes the val- 
ues of terms in a VSOP expression. For example, ((2ab + 3bc) == (2ab + 36c)) =>• 
ab + be. The last loop takes the union of all paths that end at the sinks. After 
that path queries can be performed on all paths using VSOP operations. In our 
algorithm, we query the k longest paths. Inductive building of a set of paths is 
illustrated with the example below. 

Example: In Fig. [2J £2 — 0, and £3 = 0. Cjj is the length associated with 
edge ei.j. Thus, the partial paths at V7 along with their lengths are computed as 
follows. We initialize £7 = 0. Since V2 is a source vertex, £7 = £7 + 4 • t>2- Since 
«3 is a source vertex, £7 = £7+4-«3. After that £7 = Cj-vr = 4-V2-V7+4-V3-V7. 
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Fig. 2: A DAG with edge costs. 

We compute compute £$ as follows. Since v^ is a source, £4 = 0. Thus, £$ = 
£s + 4 • t>4 and £$ = Cs + £7 + (t>2 • v-j + V3 ■ Vj) ■ 2 = 4 • V4 + 6 • t>2 • vj + 6 • V3 ■ vj. 
Finally, £$ = £% ■ vs = 4 • W4 • v$ + 6 • t>2 • vj ■ v$ + 6 • V3 ■ vj ■ v$. We compute the 
partial paths at each vertex in topological order similarly for all vertices. 



Algorithm 2 Paths and their lengths: K-longest() 

Require: V[l..iV] : Topologically sorted vertices 
for i 4- 1; i < N; i 4- i + 1 do 

for each incidence vertex Vj of Vi do 
if Vj is a source then 

£% ^ £% Ci.j ' Vj 

else 

£% ^ £% ~\~ £j ~\~ Ci.j ' (£j 
end if 

TopK(k, d) - early prune code 
end for 

£% ^ £% ' Vi 

end for 

£ <- 

for each sink Vi do 

£ 4- £ + & 
end for 

longest <~ £.MaxVal() 
return TopK(k, £) 



By calling TopKQ algorithm after every partial path set calculation at each 
vertex we can eliminate infeasible partial paths early from the set of all partial 
paths at this vertex. A partial path that is not in the top K partial paths is an 
infeasible partial path. In the final k longest path set, there cannot be a path 
starting with an infeasible partial path. Therefore, it is safe to remove them 
from consideration. This early pruning of them would speed up the algorithm. 
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3.1 Finding the k shortest paths 

We modify the TopKQ algorithm (Algorithm [T|) to find the k shortest paths. 
All we need to: 

1. change GT into LT in calculation of c2, 

2. change max <— mid to min <— mid, 

3. change min <— mid to max <— mid, and 

4. change GE in the last statement to LE. 
The modified algorithm is given in Algorithm [3J 



Algorithm 3 TopK(K, C): shortest path version 
Require: < K < \C\ 

min <— C.MinValQ 
max <— C.MaxValQ 
mid_prev <— 
while !donc do 

mid _prev 4— mid 
mid {min + max)/2 

if Ctol _EQ(mid,mid_prev).GetInt() then 

mid 4— mid + 1 

done 1 

break 
end if 

cl (C.EQ _Const(mid)) .CountTermsQ 
c2 {C.LT _Const(mid)). CountTermsQ 
c3 <- cl + c2 

flag <- c3.GT_Const(K)) && cl.LT _Const(K) 
if (cZ.EQ_Const{K))\\flag) then 

done <— 1 

forea/c 

else if (c3.LT _Const(K)).GetInt() then 
min mid 
continue 

else if (ci.GT _Const(K)).GetInt() then 

mai <— mid 

continue 
end if 
end while 

return C.FilterThen(C.LE _Const(mid)) 
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Tab. 3: c6288 



Top K 


Nodes 


Time 


Mem% 


Paths 


5a;10 5 


20592319 


6:34 


4.5 


572,976 


10 e 


26921861 


8:33 


4.6 


1,485,955 


5xlO b 


36561672 


12:19 


8.9 


6,561,113 


10 7 


44147879 


15:31 


9 


16,629,545 


All 


326301920 


64:17 


70.6 


~10^ u 



3.2 Finding the k-th longest and k-shortest path 

After we find the k longest paths, we can query the k — th longest path among 
them. Let Z be the set of k longest paths. Z.minCoverQ would return a 
minimum length path from the set Z, which is the k — th longest path. 

Similarly, we can find the k shortest paths and query the k — th shortest 
path among them. Let Z be the set of k shortest paths. Z.maxCoverQ would 
return a maximum length path from the set Z, which is the k — th shortest 
path. 

Note that k — th shortest or longest path may not be unique. In that case, 
TopK() algorithm may return more than K paths as solution set. 

4 Experimental Result 

We implemented the proposed algorithms and assessed their performances using 
a DAG model of a path-intensive c6288 circuit benchmark [5] . This benchmark 
has 32 source and 32 sink vertices with a total of 2448 vertices and 4800 edges. 
There are 9.89434xl0 19 paths from sources to sinks in this benchmark. Ex- 
periment was performed on an Intel-86-based 64-bit processor with two dual 
2.66-GHz CPUs, and 20-Gbyte RAM with the Linux operating system. 

Table[3]tabulates the results for c6288. The edge weights are set to a number 
between 1 and 10 that is generated randomly. We calculate the number of used 
ZBDD nodes, time in minutes: seconds format, percentage of memory usage, and 
the number of paths in the query of top K paths for various values of K. For 
example, Algorithm [5] computes top 5xl0 5 paths in 6 minutes and 34 seconds 
with 4.5% of 20 GB memory. The algorithm allocates 20592319 ZBDD nodes 
and returns 572,976 longest paths. Note that we do not find exactly k paths 
when the k — th path is not unique. 

5 Conclusion 

In this paper we introduced a novel algorithm based on VSOP and ZBDD to 
extract the k longest (shortest) paths and the k — th longest (shortest) path 
from a DAG. This algorithm outperforms existing algorithm when the value 
of k is very large since the run-times of the algorithms become impractical for 
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very large k values. We assessed the performance of our algorithm with a path- 
intensive DAG that has ~ 10 20 paths. We were able to compute all paths and 
their lengths in about 64 minutes. 

The proposed algorithm can be used as a front end software to selected crit- 
ical paths for timing analysis. It can also used in the path delay fault coverage 
calculation to enable delay-sensitive coverage calculation. 
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